MIRSOFT: mediator for integrating and reconciling sources using ontological functional dependencies
نویسندگان
چکیده
Providing automatic integration solutions is the key to the success of applications managing massive amounts of data. Two main problems stand out in the major studies: i the management of the source heterogeneity ii the reconciliation of query results. To tackle the first problem, formal ontologies are used to explicit the semantic of data. The reconciliation problem consists in deciding whether different identifiers refer to the same instance. Two main trends emerge in the reconciliation process: i the assumption that different source entities representing the same concept have the same key – a strong hypothesis that violates the autonomy of sources. ii The use of statistical methods that identify affinities between concepts – not suitable for sensitive-applications. In this paper, we propose a methodology integrating sources referencing shared domain ontology enriched with functional dependencies (FD). Copyright © 2012 Inderscience Enterprises Ltd. MIRSOFT: mediator for integrating and reconciling sources 73 The presence of FD gives more autonomy to sources when choosing their primary keys and allows deriving a reconciliation key for a given query. The methodology is then validated using LUBM.
منابع مشابه
Uncertain Data Integration Using Functional Dependencies
Data integration systems are crucial for applications that need to provide a uniform interface to a set of autonomous and heterogeneous data sources. However, setting up a full data integration system for many application contexts, e.g. web and scientific data management, requires significant human effort which prevents it from being really scalable. In this paper, we propose IFD (Integration b...
متن کاملPay-As-You-Go Data Integration Using Functional Dependencies
Setting up a full data integration system for many application contexts, e.g. web and scientific data management, requires significant human effort which prevents it from being really scalable. In this paper, we propose IFD (Integration based on Functional Dependencies), a pay-as-you-go data integration system that allows integrating a given set of data sources, as well as incrementally integra...
متن کاملExtensible Ontological Modeling Framefork for Subject Mediation
An approach for extensible ontological model construction in a mediation environment intended for heterogeneous information sources integration in various subject domains is presented. A mediator ontological language (MOL) may depend on a subject domain and is to be defined at the mediator consolidation phase. On the other hand, for different information sources different ontological models (la...
متن کاملOntology Functional Dependencies
We extend traditional functional dependencies (FDs) for data quality purposes to accommodate ontological variations in the attribute values. We begin by formally defining a novel class of dependencies called ontological FDs, which strictly generalize traditional FDs by allowing differences controlled by an ontology database. The ontology databases contain information about synonyms. We then foc...
متن کاملReconciling Inconsistent Data in Probabilistic XML Data Integration
The problem of dealing with inconsistent data while integrating XML data from different sources is an important task, necessary to improve data integration quality. Typically, in order to remove inconsistencies, i.e. conflicts between data, data cleaning (or repairing) procedures are applied. In this paper, we present a probabilistic XML data integration setting. A probability is assigned to ea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IJWGS
دوره 8 شماره
صفحات -
تاریخ انتشار 2012